Performance Comparison of K-Means and Expectation Maximization with Gaussian Mixture Models for Clustering EE6540 Final Project

نویسندگان

  • Devin Cornell
  • Sushruth Sastry
چکیده

In this article, we explore the theoretical aspects of the expectation maximization algorithm and how it can be applied to estimation of data as a Gaussian mixture model. We form comparisons and show how by the simplification of some parameters we can form the heuristic k-means algorithm. We then demonstrate through the authors’ code several situations where the EM-GMM algorithm performs significantly better than k-means and touch upon potential implications of these findings for arbitrary distribution approximation. Finally, we form some conclusions about scenarios where EM-GMM should be used over k-means and how to use them for classification.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Comparison of clustering methods: A case study of text-independent speaker modeling

Clustering is needed in various applications such as biometric person authentication, speech coding and recognition, image compression and information retrieval. Hundreds of clustering methods have been proposed for the task in various fields but, surprisingly, there are few extensive studies actually comparing them. An important question is how much the choice of a clustering method matters fo...

متن کامل

T An Efficient Approach for Medical Image Segmentation Based on Truncated Skew Gaussian Mixture Model Using K - Means Algorithm

In this paper, we proposed a novel approach for medical image segmentation process based on Finite Truncated Skew Gaussian mixture model. This approach considers various issues like skewness and asymmetric distributions with a finite range. We have utilized the Expectation-Maximization (EM) algorithm in estimating the final model parameters and K-Means algorithm is utilized to estimate the numb...

متن کامل

Non-parametric Mixture Models for Clustering

Mixture models have been widely used for data clustering. However, commonly used mixture models are generally of a parametric form (e.g., mixture of Gaussian distributions or GMM), which significantly limits their capacity in fitting diverse multidimensional data distributions encountered in practice. We propose a non-parametric mixture model (NMM) for data clustering in order to detect cluster...

متن کامل

Robust Method for E-Maximization and Hierarchical Clustering of Image Classification

We developed a new semi-supervised EM-like algorithm that is given the set of objects present in eachtraining image, but does not know which regions correspond to which objects. We have tested thealgorithm on a dataset of 860 hand-labeled color images using only color and texture features, and theresults show that our EM variant is able to break the symmetry in the initial solution. We compared...

متن کامل

Face recognition system based on Doubly truncated multivariate Gaussian Mixture Model

A face recognition algorithm based on doubly truncated multivariate Gaussian mixture model with DCT is introduced. The truncation on the feature vector with a significant influence on improving the recognition rate of the system using EM algorithm with K-means or hierarchical clustering is implemented. The characteristic model parameters are estimated. The EM algorithm containing the updated eq...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015